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\ ABSTRACT 

Errors  associated  with  several  boundary  encoding  schemes 
are  discussed  in  terms  of  average  errors  encountered  when 
using  the  schemes  directly  for  measurement  of  the  lengths  of 
arbitrarily  directed  straight  lines.  Ways  in  which  the 
measurement  errors  may  be  diminished  are  examined;  the 
simplest  is  to  make  an  allowance  for  the  number  of  corners 
that  appear  when  a line  is  represented  by  a U-way  code. 

The  efficiencies  of  the  various  coding  schemes  are  examined 
and  the  U-way  code  is  found  to  be  the  most  efficient  of  the 
close-neighbors  coding  schemes,  although  efficiency  can  be 
further  increased  by  use  of  a generalized  code.v 
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1.  INTRODUCTION 

The  detection  of  object  boundaries  is  fundamental  to  most  methods 
of  picture  processing  and  optical  pattern  recognition.  Encoding  these 
boundaries  as  a numerical  sequence  for  subsequent  analysis  inevitably 
results  in  sampling  errors.  The  magnitude  of  the  errors  will  depend 
on  the  coding  scheme  employed,  although  the  significance  of  the  errors 
will  depend  on  the  use  to  which  the  data  are  put  and  a general  analysis 
of  the  effect  of  the  errors  may  not  be  possible.  Nevertheless,  an 
appreciation  of  the  errors,  together  with  other  considerations,  may  well 
determine  which  coding  scheme  is  most  advantageous  in  particular 
circumstances . 

We  are  concerned  here  with  errors  made  in  determining  the  length 
of  a straight  line  or  edge  directly  from  its  coding  sequence.  There 
are  certainly  other  ways  of  determining  the  length  of  a straight  line, 
of  which  the  most  obvious  is  to  determine  its  end-points  and  calculate 
the  distance  between  them.  However,  this  calculation  may  not  be 
practicable  in  certain  situations,  e.g.,  if  the  line  were  only  "nearly" 
straight ; and  it  would  be  grossly  in  error  for  highly  curved  lines 
which  are  dealt  with  in  the  accompanying  paper  [l ] . 

Coding  schemes  are  conveniently  described  by  the  number  of  different 

possible  direction  vectors  which  may  be  taken  from  a point.  The 

schemes  first  analyzed  here  are  the  l»-way  and  8-way  codes  extensively 

discussed  by  Freeman  [2],  which  are  the  most  commonly  used  since  they 

arise  naturally  if  a line  or  scene  is  viewed  against  a square  grid,  and  ^Section" I 

the  6-way  code  which  arises  if  a hexagonal  grid  is  used.  The  greater 

the  number  of  direction  vectors  allowed  from  a point  by  a coding  scheme,  ' ■ 

the  more  accurately  may  an  arbitrary  line  be  coded  and  the  more  accurately  MU)f  J00f$ 

d/or  SPEClAi 

may  its  length  be  measured.  Coding  schemes  allowing  more  than  8 j 


■fc 
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directions,  although  sacrificing  a sequence  consisting  only  of  nearest 
neighbor  points  on  a square  grid,  are  dealt  with  when  we  consider  the 
ways  of  reducing  the  metrication  errors  of  1*-  and  8-way  coding.  We 
show  that  the  average  error  in  the  measurement  of  length,  which  is  about 
21%  for  uncorrected  4-way  coding,  may  be  reduced  to  zero  by  a simple 
algorithm. 

Finally  we  examine  the  efficiency  with  which  a line  may  be  encoded, 
in  terms  of  the  number  of  stored  bits  required  by  the  coding  scheme. 

This  analysis  shows  that  the  4-way  code  is  the  most  efficient  of  the 
close-neighbor  codes,  quite  apart  from  its  metric  corrigibility . 

2.  METRICATION  ERRORS  IN  4-WAY  CODING 

Let  a long  but  finite  line  of  length  i.  be  placed  in  an  arbitrary 
orientation  on  a square  grid.  Let  one  end  of  the  line  be  the  origin 
of  co-ordinates  and  suppose  that  the  line  is  of  sufficient  length  that 
| its  other  end  can  be  considered,  without  significant  error,  to  lie  on 

a grid  point.  Let  the  inclination  of  the  line  to  the  x-axis  be  0. 

! 

If  the  line  is  encoded  by  the  4-way  scheme,  it  is  represented  by 
a sequence  of  grid  points  {s^},  0*i$n  (Fig.  l) . (To  avoid  ambiguity 
we.  take  the  sequence  {s^}  all  lying  on  one  side  of  the  line.  The 
results  which  follow  would  be  unaltered  if  the  sequence  were  on  the  other 
side  or  were  straddling  the  line.)  The  path  from  sq  to  sn  along  grid 
lines  is  then  the  representation  of  the  arbitrary  line  under  4-way  coding 
and  the  length  of  this  path  is  what  we  refer  to  as  the  length  obtained 
directly  from  the  coding  sequence.  Its  value  is  obviously  n if  the  grid 
element  is  of  unit  length. 

Define  the  relative  error  in  the  measurement  of  the  length  of  an 
arbitrary  straight  line  inclined  at  0 to  the  x-axis  as 


e(0)  = (measured  length  - true  length)  / (true  length). 


In  the  case  of  4-way  coding  it  is  evident  from  Fig.  1 that  n,  the 
measure  of  the  line  OP,  is  equal  to  OS  + SP  = l (cos 0 + sine)  so  that 

6^(0)  B (cose  + sine)  - 1, 

It  follows  that  e(e)  s o for  6 * 0 or  2 hut  has  the  value  O.UlU  for 

IT 

6 = ¥.  If  all  values  of  6 are  equally  probable,  the  average  relative 
error,  e^,  is  readily  obtained  from  the  integral  of  e^(0 ) over  all 
angles.  Because  of  symmetry,  the  integral  can  be  restricted  to  the 

IT 

range  OsflsT  so  that 


tt 


TT 


= 0.273.  (1) 

It  is  also  of  interest  to  consider  the  standard  deviation,  o,  associated 
with  measurements  at  many  angles  of  the  length  of  the  line.  The 
standard  deviation  is  obtained  from  the  variance  of  the  measurements 
defined  by 


from  which 

ak  =*  0.124. 

Not  only  is  there  a large  error  associated  with  measurement  of 
length  from  the  sequence  obtained  by  use  of  4-way  coding,  but  also 
there  is  a large  standard  deviation  associated  with  a series  of 
measurements  of  lines  of  the  same  length  but  differing  orientations. 


These  large  values  arise  because  of  the  large  angle  between  the 
direction  vectors  available  in  4-way  coding. 


3.  METRICATION  ERRORS  IN  6-WAY  AND  8-WAY  CODING 


It  is  to  be  expected  that  as  the  number  of  direction  vectors 
available  from  the  coding  scheme  increases , the  error  and  standard 
deviation  of  the  measurement  of  length  will  decrease.  Using 
corresponding  geometrical  constructions  to  that  of  Fig.  1 and  the 
method  of  the  previous  section,  it  is  readily  shown  that  for 


6-way  coding  we  have 


- 1 = 0.103 


so  that 


2 2 , /3  12 
a,  = — + — - - — p 

6 3 IT 


a,  = 0.01*6  ; 


for  8-way  coding  we  have 


aft  = 7T  + - 

o 2 v 


= -0.100 


so  that 


ag  = 0.088. 


In  the  case  of  8-way  coding  it  is  to  be  noted  that  the  diagonal 
step  is  counted  as  being  of  unit  length.  It  is  possible  to  count  the 
diagonal  step  as  being  of  length  y/2  — the  scheme  can  be  designated 
8 (/2)-way  coding  — but  this  is  a particular  case,  2-sampling,  of  some 
coding  schemes  considered  below. 

With  regard  to  6-  and  8-way  coding,  it  can  be  seen  that  although 
their  average  relative  errors  and  standard  deviations  are  less  than  those 
of  4-way  coding,  both  remain  relatively  large. 


5. 


k. METHODS  OF  REDUCING  THE  AVERAGE  METRICATION  ERROR  IN  U-WAY  CODING 


lA 


Correction  of  grid  unit 

If,  for  an  arbitrary  straight  line  encoded  by  the  4-way  scheme 
the  grid  unit  is  taken  to  be  ■J-  (=  O.T85)  instead  of  its  actual  value 
of  1,  the  average  relative  standard  error,  e^,  is  reduced  to  zero. 

However,  the  standard  deviation  of  a series  of  measurements  is  reduced 
only  by  the  same  factor  jj-  and  so  takes  the  value  0.098,  i.e.  nearly  10/6 . 
Nevertheless,  although  this  correction  method  leaves  the  standard 
deviation  unacceptably  large  for  the  measurement  of  random  straight 
lines,  it  is  very  convenient  for  measuring  the  length  of  closed  non- 
noisy  curves,  for  which  the  standard  deviation  falls  to  very  low  values  [1  ]. 

Corner  counting 

In  a manner  of  speaking,  the  error  in  the  measurement  of  length 
of  an  arbitrary  line  or  curve  by  4-vay  encoding  comes  about  because 
the  coding  scheme  introduces  so  many  corners  (Fig.l)  and  the  question 
arises  of  whether  allowance  can  be  made  for  this.  Alternatively,  one 
could  argue  that  to  reduce  both  the  average  relative  error  and  the 
standard  deviation  to  minimal  values,  two  parameters  will  be  required 
and  since  the  only  two  quantities  immediately  in  evidence  are  the  grid 
steps  and  the  number  of  turns  (corners)  made  on  the  grid,  these  must  be 
incorporated  into  a correction  algorithm. 

Referring  to  Fig.  1,  in  the  sector  0^6$^-,  the  number  of  grid  steps 
taken  is,  as  before ,2. (cos 6 + sin6)  and  the  number  of  corners  encountered 
is  (22sin0  - 1).  If  2 is  large  enough,  the  latter  can  be  approximated 
by  22sin0 . Nov  count  each  grid  step  as  a and  for  each  corner  deduct 
a quantity  6 from  the  total  measurement . Then  the  relative  error  in 


6. 


measuring  directly  the  length  of  a line  becomes 

e^(e)  (a, 6)  = acosS  + (a  - 2f5)sin0  - 1, 
and  the  standard  deviation  is  obtained  as  before  (eq.  (2)).  By 
standard  procedures , choose  a and  6 so  that  the  average  relative 
error  is  zero  and  the  standard  deviation  is  a minimum.  This  yields 

a = 7t  ( 1 + /2)/8 

= 0.948 

and  0 = tt/8/2 

= 0.278. 

With  these  values,  the  average  relative  error  is  zero,  as  required, 
and  the  standard  deviation  has  the  value  0.023.  Although  a standard 
deviation  of  just  under  2\1o  is  not  negligible,  it  can  be  tolerated  in 
many  practical  situations . 


m- sampling 

Another  way  in  which  length  measurement  may  be  improved  is  that 
which  we  call  m-sampling:  after  the  line  is  4-way  encoded,  every  mth 
point  is  selected  and  the  length  of  the  line  is  taken  as  \ d(m) . 


J=o 


where  d(m) . is  the  geometric  distance  between  s.  and  s,.  . , , 
J JE  (j  + l; 

Fig.  2 shows  an  example  of  6-sampling. 


m 


In  the  m-sompling  of  a straight  boundary,  the  vectors  from  one 
selected  point  of  the  sequence,  s ^ , to  the  next  one,  s^  + m>  can  have 
only  a limited  number  of  values.  This  is  illustrated  in  Fig. 3 for 
vectors  in  the  first  quadrant.  If  OP  = 0Q  = m,  then  the  ends  of  the 
vectors  will  fall  on  the  line  PQ.  Furthermore,  if  0A  is  a typical 
vector,  inclined  at  angle  <)>.  to  the  x-axis , then  OB  + BA  = m;  and  if 

i 

BA  = i (where  i,  like  m,  is  an  integer  ) then  OB  = m - i and  <(k  = tan  m - i . 
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The  next  available  vector  in  Fig.  3 is  OC,  where  DC  = i+1  and  the 
angle  of  that  vector  is  <^+1  = tan  1 

A line  OP,  inclined  at  0 to  the  x-axis,  where  <|k  j:0^<(>£+1(Fig.U) , 
would  be  measured  by  a series  of  vectors  equivalent  in  Fig.  U to  OR  + RP. 
Since  (ORsin<|K  + RPsin<{K+^)  = OPsin0,  etc.,  it  readily  follows  that  in 
this  case  the  error  of  measurement  is 

e(0)  = ^ (r. (i+1)  - r.  (i))  cos0  - (r. (m-i+l)  - r.  (m-i))sin0  - m} 

m l l+l  i l+l 

2 2 2 
where  r.  = (m  - j)  + j . 

J 


Write  I.  «(*l+1 

1 k 


e(0)d0  where  <|>  = 0 and  <)>  = -r 

o m d 


Write  the  average  error  for  m-sampling  as  e (m) . 

Then  e(m)  = - (I  + I.  + . . . + I .. ) . 

it  o 1 m-1 

From  this  it  follows,  on  evaluating  the  integrals,  that 

'<”>  • f <2  - ; .1  <ri.i  - ri)2  - 1> 

1-0 


= - U - i m~l  t((m  - i+1)2  + (i+1)2) ^ - ((m-i)2  + i2)k2}  - 1 (3) 

^ 2*  i=0 

Value  of  the  average  relative  error  in  measurements  of  an  m-sampled  line 

Write  e (m)  = — E - 1 where  E is  obtained  from  eq.(3).  Table  1 
it  m m 

gives  values  of  e(m)  end  numerical,  forms  of  E for  various  veLLues  of  m. 

m 

It  is  interesting  to  note  that,  e(m)  being  always  positive,  the  average 
length  measured  increases  as  the  sampling  interval  falls  in  such  a way 
as  to  provide  a straight  line  in  a log. log  plot  of  e(m)  vs.  m (Fig. 5). 

This  closely  parallels  the  manner  in  which,  as  discussed  by  Mandelbrot  [3], 

# 

the  measured  length  of  a coastline  increases  as  the  measuring  rod  is 
reduced. 

As  expected,  e(m)  tends  to  zero  as  m becomes  very  large.  This  can 


8. 


be  proved  formally  by  referring  to  AOCA  in  Fig.  3 and  noting  that 
OA  = r^,  OC  = and  CA  = J2,  from  which  it  follows  that 

T1  * ri+l2  ' 2riri+l  - *!>  - 2 

or  (ri+1  _ r.)2  - 2 * 2Vi*l(cosUi*l  " *i>  _li 

As  m becomes  very  large,  (<|k+^  “ ) becomes  very  small.  In  these 

( d)  — r \ 2 

circumstances,  write  cos($..„  — 4> . ) — 1 = — ?i+l  ? i 

* l+l  l 

■-s^rr 

l l+l 

since  for  very  small  values,  - <J> . ) y sin(  <(>...  - <ji.)  = — , 

i i 'w  i **\L  i r . r . , _ 

i i+i 

Then  it  follows  that 


Lt  k l (ri*i  - r/  - 


2--f 


from  which,  immediately,  Lt  = jj-  and  Lt  e(m)  = 0. 


In  practice,  the  limiting  situation  would  be  finding  the  two  ends 
of  a straight  line  and  calculating  the  distance  between  them. 

It  is  worth  remarking  that  2-sampling  is  a version  of  8-way  coding 
in  which  the  diagonal  step  has  the  value  J 2 . This  case  has  been 

considered  by  Kulpa  [1* ] with  the  same  result  as  is  derived  here  from 
the  general  formula. 


5.  EXPERIMENTAL  VERIFICATION 


The  validity  of  several  of  the  formulae  given  in  the  preceding 
sections  was  confirmed  by  computer  program.  A straight  line  of  known 
length  but  with  random  orientation  was  digitized.  For  each  method  of 
measurement  the  line  (usually  of  length  200)  was  measured  at  fifty 


Freeman  [5]  (see  also  Groen  and  Verbeek  [6])  considers  the  relative 
frequency  of  unit  and  diagonal  steps  in  the  encoded  sequence  of  an 
arbitrary  straight  line.  He  reaches  his  result  with  the  assumption  that 
lines  emanate  from  each  encoded  point  with  uniformly  random  orientation. 
This  assumption  can  be  avoided  by  comparing  the  average  relative 
errors  obtained  in  the  measurement  of  straight  lines  encoded  by  the 
8-way  and  the  8(/2)-v&y  schemes.  The  difference  between  these 
average  errors  is  due  to  diagonal  steps  being  counted  as  of  length 
1 or  /2.  Suppose  that  in  the  encoded  "average  line",  of  true  length 
l,  there  are  n steps  of  which  a fraction  p are  diagonal.  Then,  with 
8-way  coding,  using  the  formula  for  average  relative  error 


7 = a zA 

e8  l 


2/2 


- 1 


and  with  8(/2)-way  coding 

r8(/2)  = n.ll.  - P)  + np/2  - l = ± 

r t 

Eliminating  n/ i yields  p = /2-1  which  is  the  result  of  Freeman  [5  ] and 
of  Groen  and  Verbeek  [6]. 

7.  CODING  EFFICIENCY 

Although,  in  particular  circumstances,  one  definite  coding  scheme 
may  be  more  convenient  than  others,  there  appears  to  be  no  general  rule 
by  which  the  merits  of  different  schemes  may  be  compared.  It  is 
nevertheless  possible  to  approach  such  a comparison  on  the  basis  of  the 
results  of  the  previous  sections. 


k 


To  begin  with,  consider  the  number  of  bits,  II,  which  must  be 

stored  in  the  encoding  of  a straight  line  under  the  coding  scheme 

chosen,  with  the  implication  that  the  smaller  the  number  of  bits  stored, 

the  higher  is  the  coding  efficiency.  For  a w-way  code  with  steps  of 

equal  length,  a straight  line  of  length  i will,  on  average,  be  encoded 

by  a sequence  of  A. ( 1 + e ) direction  vectors,  and  each  vector  will 

w 

require  loggW  bits  to  specify  it.  Hence  the  number  of  bits  required 
to  encode  the  line  is,  on  average, IJ  = £(l  + e^Jlcg^w.  Values  of 
are  given  in  Table  3 for  w = k,  6 and  6.  The  table  shows  that  t-way 
coding  requires,  on  average,  significantly  fewer  bits  to  be  stored  for 
the  encoding  of  a straight  line  than  do  either  of  the  other  two 
schemes . 

A reduction  ir.  the  amount  of  data  stored  can  be  brought  about  if 
the  h-ua.y  encoded  line  or  edge  is  m-sampled.  For  a straight  line  of 
length  £ the  k-way  encoding  results,  on  average,  in  2(l  + e^)  direction 
vectors.  If  this  sequence  is  m-sampled  (which  provides  ^m  direction 
vectors)  the  number  of  direction  vectors  is  reduced  to  1.(1  + e^)/m  but 
the  number  of  bits  required  to  specify  each  one  is  log0Um.  Hence  the 
number  of  bits  required  to  encode  the  n-sampled  line  is,  on  average, 

N = fc(l  + e1+)log2(Um)/m. 

f. 

Values  of  N /£  are  given  in  Table  3.  It  should  be  noted  that  these 
m 

values  are  for  an  m-sampled  t-way-encoded  straight  line;  for  lines 
which  initially  were  encoded  by  the  6-way  or  8-way  schemes , the  values 
of  N /£  would  be,  respectively  13.*+%  and  29.3%  smaller. 

It  can  be  seen  from  Table  3 that  m-sampled  data  of  an  encoded 


L 


straight  line  can  be  stored  efficiently,  even  when  allowance  is  made 
for  the  practical  consideration  that  bits  can  be  stored  only  as  integers. 


11. 
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The  m-sampling  of  an  encoded  straight  line  is,  of  course,  a special 
case  with  a limited  number,  l*m,  of  resulting  direction  vectors.  For 
an  arbitrarily  curved  line  for  which  the  original  close-neighbor  coding 
was  m-sampled,  more  directions  would  appear  for  each  value  of  m, 
resulting  in  a code  which  is  a variant  of  the  generalized  codes 
discussed  by  Freeman  [7].  The  form  of  such  a code  is  shown  in  Fig.  6(a) 
and,  in  Freeman's  notation,  specifying  an  m-sampled  arbitrary  curve  would 
require  a (a, . . . ,m-4,m-2,m)-code  where  a = 1 or  2 according  as  m is 
odd  or  even.  For  this  form  of  generalized  code  the  number  of  distinct 

p 2 

direction  vectors  is  (m  + l)  if  m is  odd  or  (m  + l)  -1  if  m is  even. 

In  the  accompanying  paper  [1 ] we  show  that  an  arbitrary  closed 

curve  of  length  i is  h-vay  encoded  by  l( 1 + e^)  direction  vectors. 

Accepting  this  result,  it  follows  that  if  the  coding  were  m-sampled, 

the  number  of  direction  vectors  would  be  N = (l  + e,  )log0(m  + l)  /m 

m 4 c 

if  m is  odd  or  the  corresponding  quantity  if  m is  even.  Table  3 
includes  values  of  and  the  advantage  of  m-sampling  in  terms  of 

data  storage  requirements  is  readily  seen.  However,  this  mechanical 
method  of  reducing  the  quantity  of  stored  data  risks  losing  significant 
features  of  the  encoded  curve.  In  Fig.  6(b)  if  A were  a sampling  point 
and  with  m*6,  the  protrusion  to  the  right  of  A would  disappear.  To 
avoid  such  loss  of  features  it  would  be  necessary  to  allow  m to  vary 
and  the  value  of  N^/il  would  increase  accordingly. 

Data  storage  requirements  are  not  the  only  considerations  which 
apply  in  choosing  a scheme  to  encode  a line  or  edge.  Two  other  obvious 
properties  of  lines  are  length  and  shape.  We  have  shown  above  that  a 
simple  algorithm  (a  general  correction  factor  together  with  an  allowance 
for  corners)  may  be  used  to  reduce  on  average  to  zero  the  measurement 
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error  in  determining  the  length  of  a straight  line  directly  from  its 
4-way  encoding.  This  algorithm,  moreover,  is  applicable  in  determining 
the  lengths  of  noise-free  curved  lines  [1].  It  thus  appears  that 
considerations  of  metrication  hardly  affect  the  quantification  of  the 
efficiency  of  4-way  coding.  It  should  be  mentioned  that  the  algorithm 
used  with  length  measurement  of  4-way  encoded  lines  could  be  applied, 
with  different  values  of  the  correction  factors,  to  other  coding  schemes. 
The  average  relative  error  could  not,  of  course,  be  further  reduced, 
but  the  standard  deviation  of  a series  of  measurements  could  be. 

The  shape  of  a line,  even  in  a local  region,  is  an  undefined  or 
imprecisely  defined  property  unless  the  line  is  straight  or  is  of  a 
simple  geometric  shape  (circular,  parabolic,  etc.).  If,  at  least  in 
a local  region,  the  line  is  continuous  and  differentiable,  its  curvature 
at  a point  may  be  obtained.  For  a digitized  encoded  line,  a form  of 
discrete  curvature  may  be  specified  at  each  point  from  the  two  direction 
vectors  involving  the  point  [8,  91 » or  sections  of  the  line  extending 
over  many  points  may  be  approximated  by  circular  arcs  [10].  However, 
extracting  such  features  of  a line  (11  - 13]  involves  procedures  which 
average  (one  or  more  times,  cf.  [9])  over  several  points  of  the  encoded 
sequence;  and  the  more  points  included  in  the  averaging  the  smaller 
the  dependence  on  the  coding  scheme . In  these  circumstances , and  even 
though  intuition  suggests  that  shape  is  best  encoded  by  the  available 
coding  scheme  with  most  direction  vectors,  no  measure  of  efficiency 
is  available  nor  does  practical  experience  suggest  that,  say,  4-way 
coding  is  worse  than  8-way  coding. 
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8.  CONCLUSIONS 

If  a straight  line  or  edge  is  encoded  as  a sequence  of  points, 
the  simplest  way  of  measuring  the  length  of  the  line  is  to  count  the 
number  of  steps,  each  of  known  length,  through  the  sequence.  The 
smaller  the  number  of  direction  vectors  provided  by  the  coding  scheme, 
the  greater  is  the  average  error  in  determining  length  by  this  means 
(Fig. 5);  and  with  U-way  coding  the  average  error  is  just  over  27/5  and 
the  standard  deviation  of  a series  of  measurements  is  just  over  12?. 
However,  a simple  algorithm  reduces  the  average  error  to  zero  and  the 
standard  deviation  to  less  than  2.5 ?.  It  is  necessary  to  go  to  a 
20-way  code  (5-sampling  of  U-way  encoding)  before  the  average  error  of 
length  measurement  falls  below  1?,  although  the  standard  deviation  is 
down  to  2.5#  with  8(>^)-way  coding  (2-sampling  of  U-way  encoding) 

(Table  2). 

In  terms  of  data  storage,  U-way  encoding  requires  significantly 
less  capacity,  on  average,  than  the  other  close-neighbor  coding  schemes. 
Sampling  the  U-way  encoded  data  at  regular  intervals  for  either  a straight 
line  or  an  arbitrary  closed  curve  greatly  reduces  the  necessary  storage 
capacity  (Table  3)  but  for  an  arbitrary  curved  line  this  advantage  is 
offset  by  the  risk  of  losing  significant  shape  features.  To  avoid 
such  loss  it  would  be  necessary  to  sample  at  irregular  intervals  and 
this  would  cause  an  increase  in  the  number  of  direction  vectors  and  the 
number  of  bits  stored.  Quite  generally,  details  of  shai/e  are  best 
preserved  by  coding  with  short  direction  vectors  and,  in  the  limit, 
by  using  close-neighbor  coding  schemes . 


i 
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We  conclude  that,  of  the  close-neighbor  schemes,  U-way  coding  is 
the  most  efficient.  However,  more  generalized  coding  schemes,  apart 
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from  reducing  the  number  of  bits  to  be  stored,  have  the  advantage 
pointed  out  by  Freeman  and  Saghri  [?]  of  involving  less  processing 
time  in  the  further  analysis  which  might  be  undertaken  of  the  line 
or  boundary. 
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Table  1.  Exact  formulae  and  average  errors  in  in-sampling 
of  straight  lines 


No.  of 

direction 

vectors 

E 

m 

e(m) 

1+ 

1 

0.27321+ 

8 

1 ~ 

(2  - /2)2/2 

0.051+79 

= 2/2 

- 2 

12 

1 - 

(3  - /5)2/3 

0.02556 

= 2/5  - 11/3 

fc  16  1 - {(U  - /Eo)2  + (/Eo  - /g)2}/i+  o.oH+38 

= 2/10  - 10  + 2/5 

5 20  1 - {(5  - /f?)2  + (/17  - /l3)2}/5  0.00922 

= 2 /IT  - 67  + 2/221 
5 5 

6 2l+  1 - {(6  - /26)2  + (/2?  - /20)2  + (/20  - /l8)2}/6  0.0061+1 

= 2/26  - 70  + 2/130  + 2/16 
3 3 

7 28  1 - {(7  - /37)2  + (/37  - /?9)2  + (/29  - 5)2)/7  0.001+71 

= 2/37  - 12£  + 2/1073  + 10/29 
7 7 7 

10  40  0.00231 


Table  2 


Experimental  verification  of  error  formulae 


Coding  scheme 


l+-way 

6-way 

8-way 

8(/2)-way 


2- sampled 

3- sampled 
l*-sampled 
5-sampled 
10- sampled 
20-sampled 


Theoretical  values 
e o 

Close  neighbor  codes 
0.273  0.121* 

0.103  0.01+6 

- 0.100  0.088 

0.055  0.025 

m-sampled  data 
0.055 
0.026 
0.011* 

0.0092 

0.0023 

0.00058 


Experimental 
av . error 


0.277 

- 0.081 

0.053 

0.053 

0.025 

0.011+ 

O.OO89 

0.0021 

0.00053 


corner-corrected  l*-way  encoding 


O.OOO88 


values 
s .d. 

0.120 

0.077 

0.027 

0.027 

0.015 

0.0087 

0.0057 

0.0011+ 

0.00036 

0.021+ 


0 


0.023 


Table  3.  Number  of  bits  per  unit  length  required  to  encode  a line 


STRAIGHT  LINES  ARBITRARY  CLOSED  CURVES 

Coding  scheme  Theoretical  Practical  Theoretical  Practical 


Close-neighbor  codes 


l*-vay 

2.5U6 

2.51*6 

6-way 

2.850 

3.308 

8-way 

2.701 

2.701 

m-samcled  l*-way-encoded  data 

2-sampled 

1.910 

1.910 

1.910 

1.910 

3-sampled 

1.522 

1.698 

1.698 

1.698 

l*-s ample d 

1.273 

1.273 

1.1*59 

1.592 

5 -sampled 

1.101 

1.273 

1.317 

1.528 

6-sampled 

0.973 

1.061 

1.185 

1.273 

7-sampled 

0.87*4 

0.909 

1.091 

1.091 

10-sampled 

0.678 

0.761* 

0.879 

0.891 

20-sampled 

0.U02 

0.1*  1+6 

0.559 

0.573 
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